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Abstract 

Black-box complexity theory provides lower bounds for the runtime of 
black-box optimizers like evolutionary algorithms and serves as an inspi¬ 
ration for the design of new genetic algorithms. Several black-box models 
covering different classes of algorithms exist, each highlighting a different 
aspect of the algorithms under considerations. In this work we add to the 
existing black-box notions a new elitist black-box model, in which algorithms 
are required to base all decisions solely on (a fixed number of) the best 
search points sampled so far. Our model combines features of the ranking- 
based and the memory-restricted black-box models with elitist selection. 

We provide several examples for which the elitist black-box complexity 
is exponentially larger than that the respective complexities in all previous 
black-box models, thus showing that the elitist black-box complexity can 
be much closer to the runtime of typical evolutionary algorithms. 

We also introduce the concept of p-Monte Carlo black-box complexity, 
which measures the time it takes to optimize a problem with failure proba¬ 
bility at most p. Even for small p, the p-Monte Carlo black-box complexity 
of a function class T can be smaller by an exponential factor than its typ¬ 
ically regarded Las Vegas complexity (which measures the expected time it 
takes to optimize T). 


1 Introduction 

Black-box models are classes of algorithms that are designed to help us 
understand how efficient commonly used search strategies like evolution¬ 
ary algorithms (EAs) and other randomized search heuristics (RSHs) are. 
Several models exist, each designed to analyze a different aspect of search 
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heuristics. For example, the memory-restricted model (Droste et al., 2006; 
Doerr and Winzen, 2014a) helps us to understand the influence of the pop¬ 
ulation size on the efficiency of fhe search strategy, while the ranking- 
based black-box model (Doerr and Winzen, 2014b; Teytaud and Geliy, 2006; 
Fournier and Teytaud, 2011) analyzes how much a heuristic loses by not us¬ 
ing absolute but merely relative fitness values. 

Having been introduced to the evolutionary computation commu¬ 
nity in (Droste et al., 2003, 2006), black-box complexity is a young but 
highly active area of currenf research efforts (Teytaud and Geliy, 2006; 
Anil and Wiegand, 2009; Fournier and Teytaud, 2011; Rowe and Vose, 2011; 
Doerr et al., 2011a; Lehre and Witt, 2012; Doerr et al., 2013; Doerr and Winzen, 
2014a,b,c; Doerr et al., 2014a,b; Badkobeh et al., 2014, 2015; Jansen, 2015). The 
insights from black-box complexity studies can be used to design more effi¬ 
cient genetic algorithms, as the recent (1 -I- (A, A)) GA from (Doerr et al., 2015) 
shows. 

We contribute to the existing literature a new model, which we call the elitist 
black-box model. As the name suggests, our model is designed to analyze the 
effect of elitist selection rules on the performance of search heuristics. We do 
so by enforcing thaf fhe algorithms in this model maintain a population that 
contains only the p best so-far sampled individuals. Here, /i is a parameter 
of the model (called the memory- or population-size), while quality is measured 
according to increasing fitness values. Note that for population size p > 1, 
fhe /i best so-far sampled search poinfs may have different fitness values. On 
the other hand, if more than p search points of current-best fitness have been 
sampled, only p of them can be stored in the population. 

In the evolutionary computation (EC) commimity elitist selection is very 
common as it can be seen as a literate interpretation of the "survival of fhe 
fltfesf" principle in an optimization context. Unfortunately, this term is not 
used consistently in the EC literature. As described above, we call an algorithm 
elitist if and only if fhe nexf generation consists of the p best so-far search points. 
This is a commonly used convention in the theory of EA subcommunity. In 
contrast, other subcommunities call an algorithm elitist if and only if the next 
generation contains one of fhe besf so-far solufions. Anofher nofion defines an 
algorithm to be elitist if and only if every besf so-far search point enters the 
new population, so the next population must be larger than p if fhere are more 
than p search points of currenf-besf fitness. Einally, yet another notion of elifisf 
requires that the new population only consists of search point of fhe currenf- 
best fifness value, thus the next population must be smaller than p if there 
are less than p best so-far search poinfs. Researchers using such other notions 
would therefore rather call our elitist black-box model a black-box model with 
truncation selection. 

A short version of this work has been presented at the GECCO conference 
2015 in Madrid, Spain (Doerr and Lengler, 2015a). 
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1.1 Previous Work 


In contrast to classical complexity notions discussed in the computer science 
literature, black-box complexity focuses on so-called black-box optimizers, i.e., 
algorithms that do not have access to the function at hand other than by eval¬ 
uating possible solutions, which are referred to as search points. If C is a class of 
black-box optimizers, then the C-black-box complexity of a class iF of functions 
is measured by the expected number of function evaluations that are needed 
by a best possible algorithm in C to optimize any instance f £ J- (formal def¬ 
initions will be given in Section 2). This number is a lower boimd for the effi¬ 
ciency of any algorithm A £ C and thus helps us imderstand how adequate an 
algorithmic choice is for the given problem class J-. 

Among the most important algorithmic choices in the design of evo¬ 
lutionary algorithms are the population size, the sampling strategies (of¬ 
ten called variation operators), and the selection rules. Existing black¬ 
box models cover these aspects in the following way. While the memory- 
restricted model (Droste et al., 2006; Doerr and Winzen, 2014a) and the par¬ 
allel black-box model (Badkobeh et al., 2014, 2015) analyze the influence of 
the population-size, the unbiased model (Lehre and Witt, 2012; Rowe and Vose, 
2011; Doerr et al., 2011a, 2013) considers the efficiency of search strategies 
using only so-called unbiased variation operators. The influence of the 
selection rules have been analyzed in the comparison-based and ranking- 
based black-box model (Teytaud and Geliy, 2006; Fournier and Teytaud, 2011; 
Doerr and Winzen, 2014b), with a focus on not revealing full fitness informa¬ 
tion to the algorithm but rather the comparison or the ranking of search points. 
The idea behind these models is that, in contrast to other search strategies like 
the physics-inspired simulated annealing, many evolutionary algorithms base 
their selection solely on relative and not on absolute fitness values. By providing 
only relative fitness values, the models aim at understanding how this wors¬ 
ens the performance of the algorithms, and indeed it can be shown that for 
some fimction classes the ranking-based and the comparison-based black-box 
complexities are larger than the unrestricted ones. 

While the comparison-based and the ranking-based models provide only 
relative fitness values, they do not require the algorithms to always select the 
better ones. Search strategies that adhere to this selection rule are called eli¬ 
tist algorithms in the evolutionary computation literature. Many common and 
widely applied black-box optimization strategies like {p -|- A) EAs as well as 
local hill climbers such as Randomized Local Search (RES) are of this type. 
On the other hand, many practical algorithms intentionally keep subopti- 
mal solutions to enhance population diversity, or to better explore the search 
space (Ursem, 2002; Crepinsek et al., 2013). It has been shown that in some 
situations, specific elitist algorithms like RES or the (p -I- 1) EA are inferior 
to non-elitist algorithms (Friedrich et al., 2009; Jagersktipper and Storch, 2007; 
Oliveto and Zarges, 2015). In this paper, we go one step further and investigate 
the performance of all elitist algorithms simultaneously. 

As mentioned before, algorithms in the ranking-based and comparison- 
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based models do not need to be elitist. (Ab-)Using this, algorithms can 
be designed in both models that have much smaller rimtimes than typical 
EAs (Doerr and Winzen, 2014b,a). We will see that such algorithms can cru¬ 
cially profit from eventually giving preference to search points of fitness infe¬ 
rior to that of the current best search points. 

1.2 Our Model, New Complexity Measures, and Results 

We provide in this work a model to analyze the impact of elitist selection on 
the runtime of black-box optimizers. In this elitist black-box model the popu¬ 
lation of the algorithms may contain only search points of best-so-far fitness 
values. That is, if the population size is p, then at any point in time only the 
fi best-so-far search points (of possibly different fitness values) are allowed to 
be kept in the population. Ties may be broken arbitrarily. For example, if 
more than p search points of current-best fitness have been sampled, only (an 
arbitrary selection of) p of them can be stored in the population. All other pre¬ 
viously sampled search points are not allowed to influence the behavior of the 
algorithm any more. 

We show (Section 3) that already for quite simple function classes there can 
be an exponential gap between the efficiency of elitist and non-elitist black-box 
algorithms. As we shall see in Section 3 this even remains true if we regard 
(1+1) memory-restricted unary unbiased comparison-based algorithms, which 
constitutes the most restrictive combination of the existing black-box models. 
We will see that such algorithms can crucially profit from eventually giving 
preference to search points of fitness inferior to that of the current best search 
points. We also show (Section 4) that some shortcomings of previous models 
can be eliminated when they are combined with an elitist selection require¬ 
ment. More precisely we show that the elitist unary unbiased black-box com¬ 
plexity of JUMP^ is of order fl((^"]^)) and thus non-polynomial for k = uj{l). 
In contrast, the unary unbiased black-box complexity of JUMP^u is known to be 
polynomial even for extreme values of k (Doerr et al., 2014a). 

In previous models, the black-box complexity has been defined in a Las Ve- 
gas manner, that is, it measures the expected number of function evaluations 
until the algorithm hits the optimum. On the other hand, many results in the 
black-box complexity literature are based on algorithms that with high (or con¬ 
stant) probability find the optimum after a certain number of steps, and then 
random restarts are used to bound the expected runtime. In (the strict version of) 
the elitist model, algorithms are not allowed to do random restarts since new 
search points can be kept in the population only if they are among the /i best 
ones sampled so far. Since this is a rather artificial problem (many real-world 
optimization routines make use of restarts), we introduce in this work the con¬ 
cept oi Monte Carlo black-box complexities. Roughly speaking, the p-Monte Carlo 
black-box runtime of a black-box algorithm A on a function / is the minimal 
number of queries A needs in order to find the optimum of / with probability 
at least 1 — p. The complexity class is then derived in the usual way, cf. Sec¬ 
tion 2.1. We regard in our work both Monte Carlo complexities and standard 
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(i.e.. Las Vegas) complexities. For elitist black-box algorithms these two notions 
can differ substantially as we shall see in Section 3.1. 

In the following we consider only discrete search spaces, and even more 
restrictively, only pseudo-Boolean functions / : {0,1}" —K. However, gener¬ 
alizations to non-finite or continuous search spaces are straightforward. 


2 The Elitist Black-Box Model 

The elitist black-box model covers all algorithms that follow the pseudo-code in 
Algorithm 1. To describe it in a more detailed fashion, a (p -I- A) elitist black¬ 
box algorithm is initialized by sampling p search points. We allow these search 
points to be sampled adaptively, that is, the i-th sample may depend on the 
ranking of the first i — 1 search points, where, obviously, by ranking we regard 
the ranking induced by the fitness fimction f} In each subsequent roimd a 
(p -f A) elitist black-box algorithm samples A new search points from distri¬ 
butions that depend only on the current population X and the ranking of X. 
Note that in such an optimization step the offsprings do not need to be inde¬ 
pendent of each other. Assume, for example, that we create an offspring x by 
random crossover, i.e., we take some parents from the current population and 
set the entries of x by choosing (in an arbitrary way) some bit values from these 
parents; then it is allowed to also create another offspring y from these parents 
whose entries pi in those positions i in which the parents do not agree equal 
l — Xi- These two offsprings are obviously not independent of each other. How¬ 
ever, we do require that the offsprings are created before any evaluation of the 
offsprings happens. That is, the fc-th offspring may not depend on the ranking 
or fitness of the first k — 1 offsprings. (We have decided for this version as we 
feel that it best captures the spirit of EAs such as the (p -I- A) EA that can process 
the A offsprings in parallel.) When all A search points have been generated, the 
algorithm proceeds to the selection step. In this step, the algorithm sorts the 
p -I- A search points according to their fitness ranking, where it is free to break 
ties in any way. Then the new population consists of the p best search points 
according to this ordering. 

The elitist black-box model covers many common EAs such as {p -\- A) EAs, 
Randomized Local Search (RLS), and other hill climbers. It does not cover algo¬ 
rithms with non-elitist selection rules like tournament or fitness-proportional 
selection. 

Several extensions and variants of the model are possible, including in par¬ 
ticular one in which the p first search points carmot be sampled adaptively, 
where the selection has to be unbiased among search points of the same rank, 
where only offsprings can be selected (comma strategies), or a non-ranking- 
based version in which absolute instead of relative fitness information is pro¬ 
vided. Note that the latter would allow for fitness-dependent mutation rates, 

^Two search points have the same rank if and only if they have the same fitness. The search 
points of X with maximal /-values are rank one, the ones with second largest /-values are rank 
two etc. 
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Algorithm 1: The (/i + A) elitist black-box algorithm for maximizing an 
unknown function / : {0,1}" — 

1 Initialization: 

2 A ^ 0; 

3 for f = 1,..., /i do 

4 Depending only on the multiset X and the ranking p{X, /) of X 
induced by /, choose a probability distribution over (0,1}" 
and sample according to ; 

5 _ A^AU{x«}; 

6 Optimization: for t = 1,2,3,... do 

7 Depending only on the multiset X and the ranking p{X, f) of A 
induced by / choose a probability distribution p(‘) on ({0, 

and sample ..., according to 

8 Set A ^ A U ..., 

9 for t = 1,..., A do Select a; £ argmin A and update A ■«— A \ {x}; 


which are excluded by the variant analyzed here. The lower bormds presented 
in Sections 3 and 4 actually hold for this non-ranking-based model (and are 
thus even more powerful than such only applicable to the model described in 
Algorithm 1). The model can certainly also be extended to an unbiased elitist 
one, in which the distribution p^*^ in line 7 of Algorithm 1 has to be unbiased 
in the sense of Lehre and Witt (Lehre and Witt, 2012). See Section 4 for results 
on the unbiased elitist model. 

Note that elitist black-box algorithms covered by Algorithm 1 are memory- 
restricted in the sense of (Droste et al., 2006; Doerr and Winzen, 2014a), that 
is, they cannot store any other information than the current population and 
its ranking. All information about previous search points (e.g., their num¬ 
ber) has to be discarded. The (1+1) version of the elitist model is comparison- 
based (that is, a query reveals only if an offspring has worse, equal, or bet¬ 
ter fitness than its parent), while the (y -I- A) versions are ranking-based in the 
sense of (Doerr and Winzen, 2014b). This means that the algorithm has no 
information about absolute fitness values, but it knows how the fitness val¬ 
ues of the (y -I- A) search points compare to each other. To stress the differ¬ 
ence between the latter two models, we remark the following: For y > 1 or 
A > 1 the ranking-based black-box model provides more information than the 
comparison-based one as it gives a full ranking of all current search points, 
while in the comparison-based we always have to select two search points 
which are compared against each other. The ranking-based black-box com¬ 
plexity can thus be smaller by a logarithmic factor than the comparison-based 
complexity.^ However, in the memory-restricted black-box models (with con- 


^Binary search shows that the gap between the two notions cannot be larger than a logarithmic 
factor, but on the other hand this logarithmic gap occurs already for the simple OneMax problem. 
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stant /i and A), the ranking-based and the comparison-based black-box com¬ 
plexities are asymptotically equal. 

2.1 Monte Carlo vs. Las Vegas Black-Box Complexities 

As discussed in Section 1.2, usually the black-box complexity of a function class 
T is defined in a Las Vegas marmer (measuring the expected number of function 
evaluations), while in the case of elifist black-box complexity we also introduce 
a p-Monte Carlo black-box complexity, where we allow some failure probability 
p (see below for formal definitions). If we make a sfatement about the Monte 
Carlo complexity without specifying p, fhen we mean fhat for every constant 
p > 0 the statement holds for fhe p-Monfe Carlo complexify. However, we 
sometimes also regard p-Monte Carlo complexities for non-consfant p = p(n) = 
o(l), thus yielding high probability statements. 

For most black-box complexities, the Las Vegas and the Monte Carlo no¬ 
tions are closely related: every Las Vegas algorithm is also (up to a factor of 
1/p in fhe rrmfime) a p-Monte Carlo algorithm by Markov's inequality, and a 
Monte Carlo algorithm can be turned into a Las Vegas algorithm by restarting 
the algorithm until the optimum is found. In parficular, if resfarts are allowed 
then Las Vegas and Monte Carlo complexities differ by at most a constant fac¬ 
tor. This has been made explicit in (Doerr et al., 2014b, Remark 2) and is heavily 
used there as well as in a number of other results on black-box complexity. It is 
not difficult to see that such a reasoning fails for elifisf black-box algorifhms, as 
they are not allowed to do arbitrary restarts: if fhe sampled solution infended 
for a resfarf is nof as good as the ones currently in the memory, it has to be 
discarded (line 9 of Algorifhm 1). Las Vegas and Monte Carlo elitist black-box 
complexities may therefore differ significantly from each ofher, see Section 3.1 
for an example wifh exponentially large gap. 

We come to the formal definition. Let be a class of pseudo-Boolean func¬ 
tions, and let p € [0,1). The Las Vegas complexity of an algorithm A for J' is fhe 
maximum expected number of function evaluations of / before A evaluates an 
optimal search point for the first time, where the maximum is taken over all 
f £ IF. The Las Vegas complexity of F wifh respect to a class A of algorifhms 
is fhe minimum ("besf") Las Vegas complexify among all A £ A for F. The 
p-Monte Carlo complexity of F with respect to A is the minimum number T 
such that there is an algorithm in A which has for all f £ F a probabilify of 
at least 1 — p to find an optimum within the first T function evaluations. The 
(/i -I- A) elitist Las Vegas (elitist p-Monte Carlo) black-box complexity of F is fhe Las 
Vegas (p-Monte Carlo) complexity of F with respect to the class of all (p -I- A) 
elitist black-box algorithms. 

To ease terminology, we will say that an algorithm spends time t on a func¬ 
tion / if if uses at most t function evaluations on /. Moreover, we call the 
runtime of an algorithm A on a function / the random variable describing the 

While it's ranking-based black-box complexity is only 0(n/logn) (Doerr and Winzen, 2014b), its 
comparison-based complexity is ©(n) by a straightforward application of Yao's Principle. 
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number of function evaluations of A until it evaluates for the first time an op¬ 
timal search point of /. In this way, the Las Vegas complexity of A on is the 
worst-case (over all f G F) expected runtime of the best algorithm A G A. 

If we are interested in the asymptotic p-Monte Carlo complexity of an algo¬ 
rithm A on a function class F then we will frequently make use of the following 
observation, which follows from Markov's inequality and the law of total ex¬ 
pectation. 

Remark 1. Let p G (0,1). Assume that there is an event E of probability ps < p such 
that conditioned on -if the algorithm A finds the optimum after expected time at most 
T. Then thep-Monte Carlo complexity of Aon f is at most {1 — p£){p — ps)~^T. In 
particular, ifp — p£ = fl(l) then the p-Monte Carlo complexity is OlT). 


Proof. Let R be the expected runtime of on /. By Markov's inequality and 
the law of total expectation we have 


Pr 

f — Ps 

R> -T 

< Pr 

f — Ps 

R> -T 



L p-PS J 


[ P-Ps 



• Pr hf ] -F Pr [£] 


^ P-PE ^ , 

< - (1 - PS) + PS = P- 

f-PE 


□ 


2.2 (Non-)Applicability of Yao's Principle 

A convenient tool in black-box complexity theory is Yao's Principle. In simple 
words, Yao's Principle allows to restrict one's attention to bounding the ex¬ 
pected runtime T of a best-possible deterministic algorithms on a random input 
instead of regarding the best-possible performance of a random algorithm on 
an arbitrary input. Analyzing the former is often considerably easier than di¬ 
rectly boimding the performance of any possible randomized algorithm. Yao's 
Principle states that T is a lower bound for the expected performance of a best 
possible randomized algorithm for the regarded problem. In most applications 
a very easy distribution on the input can be chosen, often the imiform one. For¬ 
mally, Yao's Principle is the following. 

Lemma 2 (Yao's Principle (Yao, 1977)). Let Abe a problem with a finite set X of 
input instances (of a fixed size) permitting a finite set A of deterministic algorithms. 
Let p be a probability distribution over X and q be a probability distribution over A. 
Then, 


min E[T(/p, A)] < maxE[T(J, A^)], (1) 

where Ip denotes a random input chosen from X according to p, Aq a random algorithm 
chosen from C according to q and T{I, A) denotes the runtime of algorithm A on input 

I. 


It is interesting to note that the informal interpretation of Yao's Principle 
given above does not apply to elitist algorithms. To illustrate this phenomenon. 
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let us consider the (1+1) elitist model, though the argument can be easily ex¬ 
tended to population-based elitist algorithms. Let p be the uniform distribution 
over the instances 


Om^ : {0, IR,x 1 -^ n - ^ (xi 0 Zi), (2) 

z € {0,1}", of the well-known OneMax problem (see Section 3 for some back¬ 
ground on this problem). Let A be any deterministic algorithm. Then we will 
show that A has a positive (in fact, fairly large) probability during the opti¬ 
mization of Ip of getting stuck in some search point x‘. as a deterministic (1+1) 
elitist algorithm there exists a search point y = y{x) such that whenever the 
algorithm sees x in the memory it samples y next. If the OMz-fitness of y is 
strictly smaller than that of x, offspring y has to be discarded immediately, in 
which case the algorithm is in exactly the same situation as before. It can thus 
never escape from x, and the expected runtime of the algorithm on Om^ is in¬ 
finite. It remains to show that this situation happens with positive probability. 
Assume that the first two search points that A queries are x and y. Note that 
A does not obtain any information from querying x, so y is independent of the 
fitness function. Moreover, we may assume x ^ y. Then there are at most 2"“^ 
search points z such that OM2(x) = OMz{y). Moreover, by symmetry (and 
the uniformity of p) half of the at least 2"“^ remaining search points satisfy 
OMz(x) < OMz( 2 /), so a runs into an infinite loop with positive probability. 
Thus every deterministic (1+1) elitist algorithm has an infinite expected run¬ 
time on a uniformly chosen OneMax instance. The lower bound in (1) is thus 
infinite, too, suggesting that the elitist black-box complexity of this problem 
is infinite as well. However, there are simple elitist randomized search strate¬ 
gies that have finite expected runtime on OneMax, for example, RLS and the 
(1 + 1) EA. 

Why does this example not contradict Yao's Principle? Reading Lemma 2 
carefully, we see that it makes a statement only about such randomized algo¬ 
rithms that are a convex combination of deterministic ones. In other words, 
the randomized algorithms (on a fixed input size) are given by making one 
random choice at the beginning, determining which of the finitely many deter¬ 
ministic algorithms we apply. For typical classes of algorithms every random¬ 
ized algorithm is such a convex combination of deferminisfic algorithms (and 
randomized algorithms are, in fact, often defined this way). In this case Yao's 
Principle can be summarized in the way we described before Lemma 2, i.e., 
as a statement that links the worst-case expected runtime of randomized algo¬ 
rithms with the best expected runtime of deterministic algorithms on random 
input. The previous paragraph, however, explains that in the elitist black-box 
model there are randomized algorithms which cannot be expressed as a convex 
combination of deterministic ones. For this reason, we can never apply Yao's 
Principle directly to the class of elitist black-box algorithms. Similar consid¬ 
erations hold for other classes of memory-restricted black-box algorithms, but 
have not been mentioned explicitly in the literature. We are not aware of any 
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other class of algorithms where such an anomaly occurs and find the putative 
non-applicability of Yao's Principle quite noteworthy 

Due to the problems outlined above, we will often consider in our lower 
bound proofs a superset of algorithms which contains all elitist ones and 
which has the property that every randomized algorithm in can be ex¬ 
pressed as a convex combination of deterministic ones. A lower bound shown 
for this broader class trivially applies to all elitist black-box algorithms. Ob¬ 
serve in particular that in a class of black-box algorithms where every algo¬ 
rithm knows the number of previous steps, every randomized strategy is a 
convex combination of deterministic strategies: the algorithm can just flip all 
coins in advance, and essentially use the i-th random bit (or bit string) in the 
i-th step. Hence, such classes may be used to apply Yao's Principle. Note that 
the lower bounds obtained in the subsequent sections are still much stronger 
than those for any of the previous black-box models. 


3 Exponential Gaps to Previous Models 

We provide some function classes for which the elitist black-box complexity is 
exponentially larger than their black-box complexities in any of the previously 
regarded models. In particular, the black-box complexity will still be small in 
a model in which all algorithms have to be unbiased, memory-restricted with 
size bound one, and purely comparison-based. This shows that our model 
strengthens the existing landscape of black-box models considerably. The ex¬ 
ample will also show that the Las Vegas complexity of a problem can be expo¬ 
nentially larger than its Monte Carlo complexity. 

3.1 Twin Peaks 

We first describe a type of landscapes for which the elitist black-box complexity 
is exponentially large. The following theorem captures the intuition that elitist 
algorithms are very bad if there are several local optima that the algorithm 
needs to explore in order to determine the best one of them. This remains true 
if we grant the algorithm access the absolute (instead of the relative) fitness 
values, as we will show in Remark 4. 

Theorem 3. Let £ > 0. Let tF be a class of functions from {0,1}” to K such that for 
every set {zi, Z 2 } C {0,1}” with zi Z 2 , 

• there is a function fzi,z 2 S F such that zi is the unique global optimum, and Z 2 
is the unique second-best search point of 

• F also contains the function /'^ that is obtained from fzi,z 2 switching the 

fitness of zi and Z 2 . More formally, is defined by = fz^,z 2 {z 2 ), 

fk,z 2 i^- 2 ) = fz^,z 2 {zi), and = fz^,z 2 {z) for z € {0,1}” \ {^^i, ^: 2 }. 

Then the (1+1) elitist Las Vegas black-box complexity and the (1/2 — e)-Monte Carlo 
black-box complexity ofF are exponential in n. 
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Proof of Theorem 3. To give an intuition, we first give an outline of the proof that 
is not quite correct. Assume that a black-box algorithm encounters either 
or /'^ By definition of /'^ it does not know the global optimum before 
querying either zi or Z 2 . It thus needs to query either or Z 2 first. Assume 
that it queries zi first. Then if the algorithm is imlucky (if zi is not the global 
optimum, i.e., the algorithm optimizes /'^ the algorithm is stuck in a local 
optimum which it cannot leave except by sampling the optimum Z 2 - Due to the 
memory restriction the algorithm has lost any information about the objective 
function except possibly that is one of the two best search points. But since 

^ G F for all 2 ; G {0,1}" \ {zi}, the algorithm would then have lost any 
information about Z 2 , and would still have test 2” — 1 possible optima. 

Unfortunately, this intuitive argument fails: after querying zi the algorithm 
does have some information about Z 2 , despite the severely restricted memory. 
For example, consider an algorithm ^^2 that only queries zi if either it has 
already tested the second-best search point, or if it has identified the function 
to be fzi,z 2 or fz^^z 2 ■ ^ particular, for every z ^ ^ 2 / in order to optimize fz^,z or 

^ the algorithm queries z before querying zi. Then never takes zi into 
memory, unless either zi is the optimum or Z 2 is the optimum. In particular, 
if soes zi in its memory, and zi is not optimal, then it can query the 

global optimum in the next step. So an algorithm can draw information from 
the order in which it queries zi and Z 2 - However, informally this information is 
limited to one bit. Therefore, the algorithm can not gain much, and the intuitive 
argument outlined at the beginning still works approximatively. 

To turn this intuition into a formal proof, we employ Yao's Principle 
(Lemma 2). As described in Section 2.2, we need to consider a larger class 
A of algorithms defined as follows. Assume the algorithm has to optimize 
fzi,z 2 or /'^ 22 - call the time until the algorithm queries for the first time zi 
or Z 2 the "first phase", while we call the remaining time the "second phase". 
Since we are not too much interested in the time that the algorithm spends in 
the first phase, we simply give away to the algorithm the set {/zi,z 2 i /zi zai- 
That is, during the first phase the algorithm knows everything about the objec¬ 
tive function except which of the two points zi or Z 2 is the optimum. We also 
give the algorithm access to unlimited memory throughout this phase. During 
this phase every randomized algorithm is a convex combination of determin¬ 
istic ones. So we may use Yao's Principle, choose a probability distribution 
on T, and restrict ourselves to an algorithm A that is deterministic in the first 
phase. For the probability distribution on T, we choose a set {zi, Z 2 } of two 
n-bit strings imiformly at random, and then we pick either /zi,z 2 or f'^^ each 
with probability 1/2. 

Note that in the first phase the algorithm does not gain any additional in¬ 
formation by querying any search point z ^ {zi,Z 2 } since it can predict the 
fitness value of z without actually querying it. We may thus assume that the 
first query of A is either zi or Z 2 . Let C be the set of all sets {zi,Z 2 }, where 
zi,Z 2 G {0,1}” and zi ^ Z 2 . In the first phase the algorithm A essentially as¬ 
signs to each set {zi, Z 2 } G C either zi or Z 2 . Let us denote the corresponding 
function by Ha ■ C ^ {0,1}”. With probability 1/2, hA{zi,z 2 ) is the global 
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optimum, and with probability 1/2 it is not. 

With probability 1/2 the algorithm enters the second phase, in which we 
no longer allow it to access anything but the current search point and possi¬ 
bly its fitness. For the sake of exposition, we first consider the case that the 
algorithm may not access the fitness, and describe afterwards how to change 
the argument otherwise. The algorithm A can be randomized in this second 
phase. Recall that the instance is taken uniformly at random, and that A sam¬ 
ples zi whenever Z 2 G := h^^{zi). Therefore, conditioned on seeing zi, the 
global optimum is uniformly distributed in Cz^. The algorithm hence needs an 
expected number of VL[Czi) additional queries to find Z 2 , and the probability to 
find the optimum with a\Cz^ \ additional queries is at most a. 

It remains to show that Cz^ is large with high probability. Let p > 0. Note 
that the average size over all zi (not the expectation over all instances!) of Cz^ 
is E ■= |C|/2” = (2" - l)/2. Let D := {zi € {0,1}” I \Cz^ < pE}. Then 
\h'^^{D)\ < 2^pE < p\C\. Since the random instance is chosen uniformly af 
random from T, the set {^ 1 ,^ 2 } G C is also uniformly at random, and with 
probability at least 1 — p an instance from C\h'^{D) is chosen, and thus | > 
pE. Thus for every p > 0, conditioned on entering the second phase \Czi \ > 
pE = n{p2^) with probability at least 1 — p. Choosing somewhat arbitrarily 
a = p = n~^ shows that with probability at least 1/2 — o(l) the algorithm 
needs at least fl(2"/n) steps. This concludes the proof. □ 

Remark 4. Theorem 3 essentially also holds if we allow the algorithms to access ab¬ 
solute fitness values. More precisely, let T he a class of functions as in 3, and let 
V{E) := max {f{z) \ f £ E, z £ {0, 1}”, z is not a global maximum of f} be the set 
of all second-best fitness values. IfV (E) has subexponential size, then the (1+1) elitist 
Las Vegas black-box complexity and the (1/2 —e)-Monte Carlo black-box complexity of 
E remain exponential even if the algorithms have access to the absolute fitness values. 

Proof. The same proof as for Theorem 3 still works, only that for every a £ 
V{E) welefC^i^o := {z 2 £ Cz^ \ fz^.z^izi) = a}. This partitions into V(^E) 
subsets, and since |1^(.F)| = 2°^’^f on average these sets are still exponentially 
large. The theorem now follows in fhe same way as before, wifh fhe sefs Czi 
replaced by □ 

The Double OneMax Problem Theorem 3 provides us with landscapes that 
are very hard for elitist algorithms. We now give a more concrete example, the 
class of double OneM AX functions. This class is of fhe type as described in The¬ 
orem 3, but at the same time it is easy for a very simple non-elitist algorithm, 
namely a variant of RLS using restarts, cf. Algorithm 3. The basis for double 
OneMax functions is OneMax, one of fhe best studied example functions 
in the theory of evolufionary compufation. The original OneMax fimction 
simply counts the number of ones in a bifstring. Maximizing OneMax thus 
corresponds to finding fhe all-ones sfring. 

Search heuristics are fypically invariant with respect to problem encoding, 
and as such they have the same expected runtime for any function from the 
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generalized OneMax function class OneMax := {Om^ | z G {0,1}”}, where 
Om^ is defined by (2). We call z, the unique global optimum of function Om^, 
the target string of Om^. 

A very simple heuristic optimizing OneMax in 0(ri log n) steps is Random¬ 
ized Local Search (RLS). Since a variant of RLS will be used in our subsequent 
proofs, we give its pseudo-code in Algorithm 2. RLS is initialized with a imi- 
form sample x. In each iteration one bit position j G [n] is chosen uniformly at 
random. The j-th bit of x is flipped and the fitness of the resulting search point 
y is evaluated. The better of the two search points x and y is kept for future 
iterations (favoring the newly created individual in case of ties). As is easily 
verified, RLS is a unary unbiased (l-i-l) elitist black-box algorithm. 


Algorithm 2: Randomized Local Search for maximizing /: {0,1}" —R. 

1 Initialization: Sample x G {0,1}" uniformly at random and query f(x); 

2 Optimization: for f = 1,2,3,... do 

3 Choose j G [n] uniformly at random; 

4 Set ^ X 0 e” and query f{y);// mutation step 

5 if f{y) > f{x) then x y, ! I selection step 


We are now ready to define the double onemax functions. For two different 
strings z\,zi G {0,1}”, let 


OM^;^ ^Z‘2 (^) 


max{OM^j(a:), 0X1^2 (x)}, ifx^Zi, 
n + 1 , otherwise. 


The unique global optimum of this function is zi, and zi is the rmique second 
best search point. For all a; ^ {- 21 ,- 22 } the fitness 0 x 12 ^^ 22 ( 3 ;) equals OM 22 , 2 i(a:). 
Unless the algorithm queries either zi or zi it can therefore not distinguish 
between the two frmctions. We consider the class of frmctions T := {OMz^^z 2 \ 
-21, Z 2 G {0,1}", zi ^ Z 2 } and show the following. 

Theorem 5. Let e > 0. The (1+1) elitist (1/2 + e)-Monte Carlo black-box complexity 
of T and its unary unbiased, (l+l)-memory restricted, comparison-based black-box 
complexity is 0{n\ogn), while the (1+1) elitist Las Vegas black-box complexity of (F 
and its (1/2 — e)-Monte Carlo black-box complexity are exponential in n even if we 
allow the algorithms to access absolute fitness values. 

Proof of Theorem 5. The class (F satisfies the conditions from Theorem 3, so the 
lower bound for the (l-i-l) elitist Las Vegas black-box complexity of F' follows 
immediately. For the upper bound, consider the random local search algo¬ 
rithm (RLS) with random restarts as given by Algorithm 3. This algorithm is 
initialized like RLS. The only difference to RLS (Algorithm 2) is that during the 
optimization process, instead of mutating the current best search point, it may 
restart completely by drawing a point y uniformly at random from ( 0 , 1 }" and 
replacing the current best solution x by y regardless of their fitness values. We 
show that this algorithm has expected optimization time 0(n log n). 
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Algorithm 3: Randomized Local Search with random restarts 

1 Initialization: Sample x G {0,1}" uniformly at random and query /(x); 

2 Optimization: for f = 1,2,3,... do 

3 With probability l/(10n logn) sample y € {0,1}" uniformly at 
random and replace x byy; 

4 else 

5 Choose j € [n] uniformly at random; 

6 Set y x(B and query f{y ); //mutation step 

7 if f{y) > f{x) then x ^ y, / /selection step 


Whenever x ^ {zi,Z 2 } then the one-bit flip has probability at least n — 
f{x) to increase the fitness of x (this can be proven by an easy case distinction 
whether or not OM 2 j(a:) > OMz 2 (x)). This is at least as large as the progress 
probability for OneMax. Therefore, if no restart happens in, say, 5n log n steps 
(which is true with constant probability) then with high probability RLS finds 
either zi or Z 2 in this time. Note that there search space is 2-vertex transitive, 
i.e., there is an automorphism of the search space that maps zi to Z 2 and vice 
versa. By definition of the same automorphism maps 

to OMz2,zi {x). Hence, since RLS with restarts is an unbiased algorithm, it will 
reach zi before Z 2 with probability 1/2, and similarly vice versa. Thus, when 
the algorithm queries either zi or Z 2 , then it finds the global optimum with 
probability 1/2. Summarizing, after each restart, the algorithm has at least a 
constant probability to find the global optimum in the next 5n log n steps. This 
proves both upper bounds in Theorem 5. 

□ 


3.2 Hidden Paths 

We provide another example with an exponential gap between elitist and non¬ 
elitist black-box complexities, which gives some more insight into the dis¬ 
advantage of elitist algorithms. We use essentially the OneMax function, 
patched with a path of low fitness that leads to the global optimum. In this 
example, every elitist algorithm fails with high probability to find the opti¬ 
mum in polynomial time, since it is blind to all search points of small fitness 
value. Both the Monte Carlo and the Las Vegas elitist black box complexity of 
the problem are exponential in n, so that (unlike the example from Section 3.1) 
the problem carmot be easily mended by allowing restarts. On the other hand, 
there are memory-restricted, unary unbiased (but not elitist) algorithms that 
solve the problem efficiently. 

For 2 G {0, 1}", let z be the bitwise complement of z, i.e., Zi = 1 — Zi for all 
i G [n\. Let further = {i = (ii,...,if) G [n]^ | ii,..., if pairwise distinct}. 
To each iGXg and each z° G {0,1}", we associate a path P(z°, i) = (z°, ■ ■ ■ ,z^) 
of length £ as follows. For j G [i], let z^ G {0,1}" be the search point obtained 
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from by flipping the i^-th bit. Note that differs from z'^ in exactly j bits. 

Now we regard the set of all OneMax functions Om^ padded with a path 
of length n/4 starting at the minimum := z'^{z) = z and leading to the 
unique global maximum 2 ”/^. Formally, let := {Om^ ^ | z € {0, l}”j* G 
2 ^ 71 / 4 }/ where for i G and P(z°, i) = (z °,..., z^), 

fn + OM 2 (a;), ifx^F(z°,i), 

Om^ ^(x) := < j, if X = z-^ for 0 < j < £, 

[ 274 + 1 , if X = z^. 

Theorem 6. The unary unbiased (1+1) memory-restricted black-box complexity of 
T is O(n^), while its (1+1) Monte Carlo (and thus, also Las Vegas) elitist black-box 
complexity is also for the non-ranking-based version of the elitist model in which 

full (absolute) fitness information is revealed to the algorithm. 

Proof. For the upper bound, we need to describe a memory-restricted unary 
unbiased black-box algorithm A that optimizes f G F in quadratic time. The 
algorithm proceeds as follows. While its current search point has fitness at 
least n, it finds the local optimum z using Randomized Local Search (RLS). This 
takes expected time 0(n log n). From z it jumps to the starting point z° = z of 
the path P(z, i ). The algorithm now follows the path by using again RLS but 
accepting an offspring if and only if it increases the parent's fitness by exactly 
1 or if the offspring's fitness is 2n + 1. In particular, in this phase the algorithm 
rejects any search point with fitness between n and 2n. Since this algorithm 
needs time 0{n) to advance one step on the path, and the path has length 
0{n), it has expected runtime 0{n'^). 

For the lower bormd we again extend the class of elitist black-box algo¬ 
rithms to a larger class A that allows to apply Yao's Principle. After an algo¬ 
rithm in A has sampled its first search point, we distinguish two cases. If the 
search point has fitness at most n + n/4, then the algorithm may access the po¬ 
sition of the global optimum (and thus, terminate in one more step). If the first 
search point has fitness larger than n + n/4, then the algorithm may access the 
position of the local optimum z. Moreover, it may access a counter that tells it 
how many steps it has performed so far. Apart from that, it may only access 
(one of) the best search point(s) it has found so far, and its fitness. Then A is 
the set of all algorithms that can be implemented with this additional informa¬ 
tion. In this way, every randomized algorithm in ^ is a convex combination 
of deterministic ones, so that we can apply Yao's Principle. So let A G ^ be a 
deterministic algorithm, and consider the uniform distribution on F. 

If the first search point has fitness 2n + 1 or at most n + n/A, then A is done 
after one query or it can terminate in at most one additional step, respectively. 
However, by the Chernoff bound these two events happen only with proba¬ 
bility so from now on we assume that the first search point has fitness 

larger than n + n/4. Observe that by the accessible information the algorithm 
can determine the OneMax value OM 2 (x) for all x G {0,1}”. In particular, for 
every search point of larger fitness except for z^ the algorithm can predict the 
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fitness value without querying it. On the other hand, if it queries a search point 
of lower fitness, then it is not allowed to keep its fitness value. Thus A can¬ 
not obtain additional information about / except by querying the optimum 
Since was chosen uniformly at random, all search points in distance i = nl^: 
from = z have the same probability to be the global optimum. Hence, A 
needs in expectation at least („" 4 ) /2 = queries to find the optimum. □ 

Remark 7. A similar statement as the one in Theorem 6 holds also for ranking-based 
algorithms if we slightly increase the memory of the algorithms regarded. Indeed, there 
exists a unary unbiased (2+1) memory-restricted ranking-based algorithm optimizing 
T in expected 0{nf) function evaluations. Regard, e.g., the algorithm that maintains 
throughout the second phase a search point of fitness n -t 1 and that accepts an 
offspring y of z^ if and only if the fitness ofy is larger than that of z^ but smaller than 
that of (in which case y = z^'^^). Then z^ is sampled (but not accepted into the 
population, see Remark 8) after 0{nf) steps. 

On the other hand, the (2+1) elitist black-box complexity is still exponential, since 
with high probability the first two search points the algorithm samples have fitness 
n/2 -t o(n). 

Remark 8. As indicated in Remark 7 it can make a crucial difference for (non-elitist) 
black-box algorithms if we only require them to sample an optimum or whether we 
require the algorithm to accept it into the population. For example, the algorithm 
described in Remark 7 does not accept the optimum when finding it. 


4 Combining Unbiased and Elitist Black-Box Mod¬ 
els 

In this section we demonstrate that apart from providing more realistic lower 
bounds for some function classes, the elitist black-box model is also an inter¬ 
esting counterpart to existing black-box models. Indeed, we show that some of 
the unrealistically low black-box complexities of the unbiased black-box model 
proposed in (Lehre and Witt, 2012) disappear when elitist selection is required. 

More specifically, we regard the unary unbiased (l-i-l) elitist black-box com¬ 
plexity^ of Jump functions, which (in line with (Doerr et al., 2014a)) we define 
in the following way. For a parameter k the function JUMP^. assigns to each 
bit string x the function value JUMPj,(x) = OneMax(x) if OneMaxJcc) G 
{0} U {fc -I- 1,..., n — fc — 1} U {n} and JUMP;.(x) = 0 otherwise. Despite the fact 
that all common search heuristics need ) fitness evaluations to optimize 

this function, the unary imbiased black-box complexity of these fimctions are 
surprisingly low, see Table 1 for a summary of results presented in (Doerr et al., 
2014a) ((Doerr et al., 2011b) for k = 1). Interestingly, even for extreme jump 
functions in which only the fitness value n/2 is visible and all other OneMax 

®That is, the complexity with respect to all (1+1) elitist black-box algorithms for which 
the sampling distributions in line 7 of Algorithm 1 are unbiased in the sense of Lehre and 
Witt (Lehre and Witt, 2012). 
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Model 

range of fc 

unary unbiased 

elitist unary unbiased 

Constant Jump 

1 < fc = 0(1) 

0(n logn) 


Short Jump 

fc = 0(n^/^“'^) 

0(nlogn) 


Long Jump 

fc = (1/2 — e)n 

0{n^) 


Extreme Jump 

k = n/2 — 1 

0{n^L^) 

0(2"/v^) 


Table 1: Comparison of the imary unbiased black-box complexities of JUMP^, 
with the respective (Las Vegas and Monte Carlo) elitist ones for different 
regimes of k. 


values are replaced by zero, polynomial-time unary rmbiased black-box algo¬ 
rithms exist. It is thus interesting to see that the situation changes dramatically 
when the algorithms are required to be elitist, as the following theorem shows. 

Theorem 9. For k = 0 the (Las Vegas and Monte Carlo) unary unbiased (1+1) 
elitist black-box complexity of the jump function JUMP;, is 0(nlogn). For all 1 < 
fc < n/2 — 1 ft is 0((j,"^)). In particular, for k = w(l) the black-box complexity is 
superpolynomial in n and for k = Q{n) it is exponential. 

Proof of the Upper Bound in Theorem 9. For any constant k the upper bound is 
achieved by the simple (1 -I-1) EA (Droste et al., 2002). For general k, consider 
the algorithm A that produces an offspring as follows. With probability 1/3 the 
offspring is a search point imiformly at random from {0,1}”/ with probability 
1 /3 the algorithm flips exactly one bit (uniformly at random), and with proba¬ 
bility 1/3 it flips exactly fc -I- 1 bits (also uniformly at random). The offspring is 
accepted if its fitness is at least the fitness of the current search point. This algo¬ 
rithm finds a point of positive fitness in expected time 0(y/n) since it produces 
random search points with probability 1 /3 and each such uniform sample has 
One Max value n/2 wifh probability 0(1/^^). Then with high probability it 
increases the fitness to n — fc — 1 in at most 0(n log n) steps by one-bit flips 
(and possibly (fc -I- l)-bit flips). Afterwards, since there are search points 
in distance k -h 1, the algorithm needs in expectation at most 3(j,"^) steps to 
find fhe opfimum. This proves the upper bound for the Las Vegas complexity, 
which in turn implies the upper bound for the Monte Carlo complexity. □ 

It remains to prove the lower bound. For fc = 0 it follows 

from (Lehre and Witt, 2012, Theorem 6). For general fc, as an intermediate step, 
we show the following general result. 

Theorem 10. Assume that f is a function with a unique global maximum Xopt, and 
assume further that a unary unbiased (1+1) elitist black-box algorithm A is currently 
at a search point x f Xopt- Let 0 < d < n/2, and let dist denote the Hamming 
distance. Let 

S := {x' f Xopt I distja:/ Xgpt} < dor distjcc', Xgpt} > n — d} 
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he the distance-d neighborhood of Xopt and of its bitwise complement If all search 

point in S have fitness less than f{x), then A needs in expectation at least 
additional queries to find the optimum. Moreover, for every a > 0, the probability that 
it needs at most additional queries is at most a. 

Proof. Since A is elitist, it can never accept a point in S. Therefore, in every 
subsequent step before finding the optimum, it will be in some search point y 
with distance d' £ [d+l,n — d — 1] from the optimum. If an unbiased mutation 
has some probability p to produce Xopt from y, then every other search point in 
distance d' has also probability p to be the offspring. In particular, since there 
are (^,) such points, p{^,) equals the probability that the offspring has distance 
d' of y, which is at most 1. Hence, at any point the probability to sample the 
optimum in the next step is at most Therefore, A needs in 

expectation at least steps to find the optimum. Moreover, by the union 
bound the probability that A needs less than steps is at most a. □ 

Proof for the Lower Bound in Theorem 9. Assume for simplicity that n is a power 
of 2. In order to apply Yao's Principle (Lemma 2), we allow the algorithm to 
remember its complete search history. We then regard a deterministic algo¬ 
rithm A on a rmiformly chosen JUMP;, frmction. That is, the target string z of 
the OneMax function underlying the JUMP^, frmction is chosen from {0,1}" 
uniformly at random. 

It is known that we can partition the hypercube {0,1}” into 2”/n sets 
Si,..., S' 2 n /n of size n each such that for each i G [2"/n] the pairwise distance 
between any two points in Si is exactly n/2 (e.g., the cosets of the Hadamard 
code as in Arora and Barak (2009)). 

Let i be the index of the set containing the target string z, i.e., z G Si := 

,..., s”}. Regardless of the jump-size k, each search point in Si has positive 
fitness. Indeed, for each j either we have = z (in which case the fitness of 
equals n) or the distance and thus the fitness of s-l to z equals n/2. Since z is 
chosen uniformly at random, the probability that the first search point of set Si 
that the algorithm A queries is the target string z equals 1/n. This shows that 
with probability at least 1 — 1/n the first search point with positive fitness that 
A queries is not the optimum. But then Theorem 10 tells us that the algorithm 
needs at least additional steps in expectation, and at least with 

probability at least a. This proves the claim. □ 


5 Conclusions 

We have introduced elitist black-box complexity as a tool to analyze the perfor¬ 
mance of search heuristics with elitist selection rules. Several examples provide 
evidence that the elitist black-box complexities can give a much more realis¬ 
tic estimation of the expected runtime of typical search heuristics. We have 
also seen that some unrealistically low black-box complexities in the unbiased 
model disappear when elitist selection is enforced. 
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We have also introduced the concept of Monte Carlo black-box complexi¬ 
ties and have brought to the attention of the community the fact that these can 
be significantly lower than the previously regarded Las Vegas complexities. 
In addition, it can also be significantly easier to derive bormds for fhe Monte 
Carlo black-box complexities, see (Doerr and Lengler, 2015b). Both complexity 
notions correspond to runtime analysis statements often seen in the evolution¬ 
ary computation literature and should thus co-exist in black-box complexity 
research. 

While we regard in this work toy-problems, it would be interesting to an¬ 
alyze the influence of elifisf selection on fhe performance of algorifhms in 
more challenging optimization problems. Our findings enliven fhe question 
for which problems non-elitist selection like tournament or so-called fitness- 
dependent selection can be beneficial, initial findings for which can be found 
in (Oliveto and Zarges, 2015; Friedrich et al., 2009). Negative examples are pre¬ 
sented in (Oliveto and Witt, 2014; Happ et al., 2008; Neumann et al., 2009). 
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